Coordinate Refinement on All Atoms of the Protein Backbone with Support Vector Regression
نویسندگان
چکیده
For the past decades, many efforts have been made in the fields of protein structure prediction. Among these, the protein backbone reconstruction problem (PBRP) has attracted much attention. The goal of PBRP is to reconstruct the 3D coordinates of all atoms along the protein backbone for given a target protein sequence and its Cα coordinates. In order to improve the prediction accuracy, we attempt to refine the 3D coordinates of all backbone atoms by incorporating the stateof-the-art prediction softwares and support vector regression (SVR). We use the predicted coordinates of two excellent methods, PD2 and BBQ, as our feature candidates. Accordingly, we define more than 100 possible features. By means of the correlation analysis, we can identify several significant features deeply related to the prediction target. Then, a 5-fold cross validation is carried out to perform the experiments, in which the involved datasets range from CASP7 to CASP11. As the experimental results show, our method yields about 8% improvement in RMSD over PD2, which is the most accurate predictor for the problem.
منابع مشابه
Prediction of Protein Backbone Structure by Preference Classification with SVM
Given the primary sequence of a protein and its α-carbon coordinates, the allatom protein backbone reconstruction problem (PBRP) is to reconstruct the 3D coordinates of all atoms, including N, C, and O atoms on the backbone. A variety of methods for solving PBRP have been proposed, such as Adcock’s method, SABBAC, BBQ, and Chang’s methods. In this paper, we involve BBQ (Backbone Building from Q...
متن کاملRefinement on O Atom Positions for Protein Backbone Prediction
For given the 3D coordinates of Cα in a protein, the all-atom protein backbone reconstruction problem (PBRP) is to rebuild the 3D coordinates of all major atoms (N, C and O) on the backbone. The previous works show that the prediction accuracy of the 3D positions of the O atoms is not so good, compared with the other two atoms N and C. Thus, we aim to refine the positions of the O atoms after t...
متن کاملProtein Backbone Reconstruction with Tool Preference Classification for Standard and Nonstandard Proteins
Given a protein sequence and the Cα coordinates on its backbone, the all-atom protein backbone reconstruction problem (PBRP) is to reconstruct the backbone by its 3D coordinates of N, C and O atoms. In the past few decades, many methods have been proposed for solving PBRP. Related research reveals that if proper prediction tools are selected to build the 3D coordinates of the desired atoms, the...
متن کاملSupport vector regression with random output variable and probabilistic constraints
Support Vector Regression (SVR) solves regression problems based on the concept of Support Vector Machine (SVM). In this paper, a new model of SVR with probabilistic constraints is proposed that any of output data and bias are considered the random variables with uniform probability functions. Using the new proposed method, the optimal hyperplane regression can be obtained by solving a quadrati...
متن کاملPrediction of daily evaporation using hybrid support vector regression-firefly optimization algorithm and multilayer perceptron
Prediction of daily evaporation is a valuable and determinant tool in sustainable agriculture and hydrological issues, especially in the design and management of water resources systems. Therefore, in this study, the ability of artificial intelligence models of multi-layer perceptron (MLP), support vector regression (SVR), and the hybrid model of support vector regression-firefly optimization a...
متن کامل